Fast and accurate text classification via multiple linear discriminant projections
نویسندگان
چکیده
منابع مشابه
Linear Discriminant Text Classification in High Dimension
Linear Discriminant (LD) techniques are typically used in pattern recognition tasks when there are many (n >> 10) datapoints in low-dimensional (d < 10) space. In this paper we argue on theoretical grounds that LD is in fact more appropriate when training data is sparse, and the dimension of the space is extremely high. To support this conclusion we present experimental results on a medical tex...
متن کاملUsing Multiple Discriminant Analysis Approach for Linear Text Segmentation
Research on linear text segmentation has been an on-going focus in NLP for the last decade, and it has great potential for a wide range of applications such as document summarization, information retrieval and text understanding. However, for linear text segmentation, there are two critical problems involving automatic boundary detection and automatic determination of the number of segments in ...
متن کاملLinear Programming Approaches for Multiple-Class Discriminant and Classification Analysis
New linear programming approaches are proposed as nonparametric procedures for multiple-class discriminant and classification analysis. A new MSD model minimizing the sum of the classification errors is formulated to construct discriminant functions. This model has desirable properties because it is versatile and is immune to the pathologies of some of the earlier mathematical programming model...
متن کاملSeparable Linear Discriminant Classification
Linear discriminant analysis is a popular technique in computer vision, machine learning and data mining. It has been successfully applied to various problems, and there are numerous variations of the original approach. This paper introduces the idea of separable LDA. Towards the problem of binary classification for visual object recognition, we derive an algorithm for training separable discri...
متن کاملDiscriminant Projections Embedding for Nearest Neighbor Classification
In this paper we introduce a new embedding technique to linearly project labeled data samples into a new space where the performance of a Nearest Neighbor classifier is improved. The approach is based on considering a large set of simple discriminant projections and finding the subset with higher classification performance. In order to implement the feature selection process we propose the use ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The VLDB Journal The International Journal on Very Large Data Bases
سال: 2003
ISSN: 1066-8888,0949-877X
DOI: 10.1007/s00778-003-0098-9